Material for : Batched Bandit Problems

نویسندگان

  • Vianney Perchet
  • Philippe Rigollet
  • Sylvain Chassang
  • Erik Snowberg
چکیده

Motivated by practical applications, chiefly clinical trials, we study the regret achievable for stochastic bandits under the constraint that the employed policy must split trials into a small number of batches. We propose a simple policy, and show that a very small number of batches gives close to minimax optimal regret bounds. As a byproduct, we derive optimal policies with low switching cost for stochastic bandits. In this supplementary material we compare, in simulations, the various policies (grids) introduced in [PRCS15]. These are also compared withUcb2 [ACBF02], which, as noted in [PRCS15], can be seen as an M batch trial with M = Θ(log T ). The simulations are based both on data drawn from standard distributions, and from a real medical trial: specifically, data from Project AWARE, an intervention that sought to reduce the rate of sexually transmitted infections (STI) among high-risk individuals [MFGea13]. Of the three policies introduced here, the minimax grid often does the best at minimizing regret. While all three policies are often bested by Ucb2, it is important to note that the latter algorithm uses an order of magnitude more batches. This makes using Ucb2 for medical trials functionally impossible. For example, in the real data we examine, the data on STI status was not reliably available until at least six months after the intervention. Thus, a three-batch trial would take 1.5 years to run—as intervention and data collection would need to take place three times, six months apart. However, in contrast, Ucb2 would use as many as 56 batches—meaning the overall experiment would take at least 28 years. Despite this extreme difference Supported by ANR grant ANR-13-JS01-0004. Supported by NSF grants DMS-1317308, CAREER-DMS-1053987. Supported by NSF grant SES-1156154. AMS 2000 subject classifications: Primary 62L05; secondary 62C20

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Batched Bandit Problems

Motivated by practical applications, chiefly clinical trials, we study the regret achievable for stochastic bandits under the constraint that the employed policy must split trials into a small number of batches. We propose a simple policy that operates under this contraint and show that a very small number of batches gives close to minimax optimal regret bounds. As a byproduct, we derive optima...

متن کامل

Design of a Hybrid Genetic Algorithm for Parallel Machines Scheduling to Minimize Job Tardiness and Machine Deteriorating Costs with Deteriorating Jobs in a Batched Delivery System

This paper studies the parallel machine scheduling problem subject to machine and job deterioration in a batched delivery system. By the machine deterioration effect, we mean that each machine deteriorates over time, at a different rate. Moreover, job processing times are increasing functions of their starting times and follow a simple linear deterioration. The objective functions are minimizin...

متن کامل

Generalized Bandit Problems

1 The questions addressed in this paper grew out of my work with Jeff Banks on bandit problems and their applications (Banks and Sundaram (1992a, 1992b, 1994)) and owe much to many discussions I had with him on this subject. I also had the benefit of several discussions with Andy McLennan, especially regarding the material in Sections 4 and 6 of this paper.

متن کامل

MAGMA Batched: A Batched BLAS Approach for Small Matrix Factorizations and Applications on GPUs

A particularly challenging class of problems arising in many applications, called batched problems, involves linear algebra operations on many small-sized matrices. We proposed and designed batched BLAS (Basic Linear Algebra Subroutines), Level-2 GEMV and Level-3 GEMM, to solve them. We illustrate how to optimize batched GEMV and GEMM to assist batched advance factorization (e.g. bi-diagonaliza...

متن کامل

Optimizing the SVD Bidiagonalization Process for a Batch of Small Matrices

A challenging class of problems arising in many GPU applications, called batched problems, involves linear algebra operations on many small-sized matrices. We designed batched BLAS (Basic Linear Algebra Subroutines) routines, and in particular the Level-2 BLAS GEMV and the Level-3 BLAS GEMM routines, to solve them. We proposed device functions and big-tile settings in our batched BLAS design. W...

متن کامل

Particle swarm optimization for minimizing total earliness/tardiness costs of two-stage assembly flowshop scheduling problem in a batched delivery system

This paper considers a two-stage assembly flow shop scheduling problem. When all parts of each product are completed in the first stage, they are assembled into a final product on an assembly machine in the second stage. In order to reduce the delivery cost, completed products can be held until completion of some other products to be delivered in a same batch. The proposed problem addresses sch...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2015